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pDG2: 

G TT AACT ACG TCAGGTGG CACTTTTCGGGG AAATGTG 
TGTATCCGCTCATGAGACAATAACCCTGATAAATGCTT^ 
CGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTrCCTGTT^ 
TGCTGAAGATC^GTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTC^ 

CCGAAGAACGTTCT CCAATGATG AGCACTTTTAAAGTTCTG CTATGTGGCG CGGT ATTATC CCGTGTTG ACGCCGGGCAA 
GAGCAACTCGGTCGCCGCATA^CTATTCTCAGAATGACrTGGTTGACT 

TGGCATG ACAG TAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAA CACTGCGG CCAACTT ACTTCTG ACAACG A 

TCGGAGG ACCG AAGG AGCT AACCGCTTTTTTC CACAACATGGGGGATC^ TTGGG AACCGG AG 

CTGAATGAAGCCATACCAAACGACGAGCGTGAC^CCACGATGCCTGTAGCM 

TGGCGAACTACTTACTCTAGCTTCCCGGCA^CAATTAATAGACTGGATGGA 

GCTCGGCCCrrCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTC 

CTGGGGCC^GATGGTAAGCCCTCCCGTATCGTAGTTATCTACACt^ 

ACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAA^ 

ATTTACCCCGGTTGATAATCAGAAAAGCCCCAAAAAOU^^ 

TTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCA 

AAATCAAAAGAATAGCCCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAA 

CAACGTCAAAGGGCGAAAAACCX5TCTATCAGGGCGATGGCC 

CGAGGTG C CGT AAAG CACT AAATCGG AACCCTAAW3GGAGCCCCCG ATTTAGAG CTTG ACGGGG AAAGCG AACGTGG CG A 
G AAAGG AAGGGAAG AAAGCG AAAGG AG CGGG CG CT AGGG CG CTGGCAAGTGT AG CGGT CACG CTG CG CG T AAC CAC CACA 
CCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTAAAAGGATCTAGGTGAAGATCCTrTT^ 

TC C CTT AA CG TG AGTTTT CG TTCCACTG AG CGTCAG ACC C CGTAG AAAAG A T CAAAGG AT CTT CTTG AG AT C CTTTTTTT 
CTGCGCGTAATCTGCTCCTTGCAAACAAAAAAACC^ 

TCTTTTT CCG AAGGT AACTGG CTTCAGCAG AG CG CAGATACCAAATACTGTTCTTCTAG TGTAGCCG TAG TTAGG CCACC 
ACTTCAAGAACT CTGT AGCACCG C CT ACATACCTCGCTCTG CT AATCCTGTTACCAGTGG CTG CTG CCAGTGG CG AT AAG 
TCX3TGTCTTACCGGGTTGG ACTCAAG ACGATAGTTACCXX^ CTG AACGGGGGGTTCGTG CAC 

AGAGCCCAGCTTGGAGCGAACGACCTACACCGAACT 

AAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCAC 

G CCTGGTATCTTTATAGTC CTGTCGGGTTTCG CCACCTCTGA CTTGAG CGT CG ATTTTTGTG ATGCTCG T CAGGGGGGCG 
G AGC CT ATGG AAAAACG CCAG CAACG CGG C CTTTITACXKJTTCCTGGCCTTTTGCTGG C CTTTTG CTCACATG TAATGTG 
AGTTAGCTCACTCATTAGGCACCCCAGGCn^TTACACT 

ACAATTTC^CACAGG AAA(^GCTATG ACCATG ATT ACGCCAAGCT CGG CCG CGTTTAAAC 

AATGTGCTCCTCTTTGGCTTGCTTCCGCGGGCCAAGCCAGACAAG 

AG CGGCGCG CCG AATTCCTGCAGGATTCGAGGGCCCCTCGAGGT 

CAGTCTGGAGCATGCGCITTAGCAGCCCCX3CTGGCACTTGGC^ 

TCCACCGGTAGCXJCCAACCGGCTCCGTTCTTTGGTGGCCCCrrCGC^ 

CCCCCCGCCCrcCAGCTCGCGTCGTGC^GGACGTGACAAATGGAAGTAGCACGTCTCACT 

CA C CG CTG AG CAATGG AAG CGGG T AGG C CTTTGGGG CAG CGG CCAAT AG CAG CTTTG CTCC TTCG CTTT CTGGG CTCAG A 

GGCTGGGAAGGGGTGGGTCCGGGGGCGGGCTCAGGGGCGGGCTCAGGG^ 

GGCATTCTCG CACG CTT CAAAAGCGCACXJTC 

CAATATGGG ATCGGCCATTG AACAAGATGGATTG CACX3CAGGTTCT CC^ CT ATTCGG CT ATG 

ACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCXJTGTTCCGGCTC 

AAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACCAGGCAGCX? 

TTG CG CAG CTGTG CT TO ACGTTGTCACTG AAGCGGGAAGGG ACTGGCTGCT ATTGGGCG AAGTG CCGGGG CAGG AT CT CC 

TGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATC^^ 

ACCTGCCC^TTCGACCACCAAG(XAAACATCGCA.TCGAGCGAGCA 

TGATCTGGACGAAGAGCATCAGGGGCTOTCGCCAGCCGAACTGTTCGCCAGGCTC^GGCGCGCATGCCCG 
ATCTCGT CGTGACCCATGGCGATGCCTG CTTG CCG AATATCATGGTGGAAAATGG CCG CTTTTCT 
GGCCGGCTGGGTGTGGCCWACCGCTATCAGGAC^TAGCGTTGGCrACCCGTGATATTGCTGAA^ 
GGCTGACCGCTTCCTCGTGCTTTACGGTATCGCrc 

TCTTCTGAGGGGATCGATCCGTCCTGTAAGTCTGCAGAAArrGATGATCTATTAAAC^TAAAGATGTCCAC 
AAGTTTTTCCTGTCATACrrTTGTTAAGAAGGGTGAGAACAGAGTACCT 

GTGGGGGTGGGGTGGGATTAGATAAATGCCTGCTCITrACTGAAGGCTCTTTACTATTGCTTTATGA 
TTGGATATCATAATTTAAACAAGCAAAACCAAATTAAG^ 

TCTCTCXSTGGGATCATTGTTTTTCTCTTGATTCCCACTTTGTGGTrCTAAGTACTGTGG 
TAGCCTGAAGAACGAGATCAGCAGCCTCTGTTCCACATACACnTCATTCTCAGTATTGTTTTGCCMG 
CAGAAGCTGACTCTAGATCTGGATCCGGCCAGCTAGGCCGTCGACCTCGAGTGATCAGGTACCAAGGTCCTCGCTCTGTG 
T C CG TTG AG CTCG A CG AC AC AGG AC ACGCAAATT AATT AAGG CCGG CCCGTACCCTCT AGT C AAGGC CTT AAG TG AGT CG 
TATTACGGACTGGCCGTCGTTTTACAACGTCGTGACTGGGA^ 

TCCCCCTTTCGCCAGCrGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAG TTGCGCAGCCTGAATGG CG 
AATGGCGCTTCGCTTGGTAATAAAGCCCGCTTCGGCGGGCTTTTTTTT 




FIGURE 3A 



FIGURE 3B 



pDGi: 

GTTTAATAGTAATCAATTACGGGGT 
CCCGCCTOTCTGACCGCCCAA^ 

CTTTCCAATGACGTCAATGGGTGGAGTATT7ACGGTAAACTG CCCACTTGGCAGTACATCAAGTGT ATCATATG CCAAGT 

ACGCCCCCTATTGACGTCAATGACGGAAAATGGCCCGCCTGGCATTMGCCCAGTACA 

TTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGG^ 

GGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGLRCGTCAATGGGAG 

TTTCCAAAATGTCGTAACAACTCCGCCC 

AG CTGGTTTAGTG AA CCGT CAG ATCCGCTAGCG CTACCGGTCG CCACCATGGTGAGCAAGGG CG AGG AGCTG TTCAC CGG 

GGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCC^ 

CCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGCCAAGC^ 

CTGACCTA CGG CGTGCAGTGCTTCAGCCGCTACCCCGACCACA.TGAAGCAGCACGAC^ 

AGGCTACGTCCAGGAGCGCACCATC T TCTTCAAGGACGACX& 

ACACCCTGGTG AACCG CATCGAG CTG AAGGGCATC^CTTCAAGGAGGACGGCAACATCCTGGGG CACAAG CTGGAG T AC 

AACTACAACAGCCACAACGTCTATATCATGGCCGACAAG CAGAAGAACGGCATCAAGGTG AACTTCAAG ATCCG C CACAA 

CATCGAGGACGGCAGCGTGCAGCTCXJCCGACCACTACCAGCAGAACACCCCCA^ 

ACAACCACTACCTGAGGACCCAGTCCGCCCTGAGCAMGACCCCAACGAGAAGC^ 

GTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTCCGGACT 

CATAATCAGCCATACCACATTTGTAGAGGTTTTACTTGCTI^ 

AAATGAATGCAATTGTTGTTGTTAACTTGTTTATTGC^ 

ACkAATAAAGCATT7TTTTCACTGCATTCTAGTTC 

GGTGGCACTTTTCGGGGAAATGTGCGCXK3AACCCC^ 

GAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGA^ 

TTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCr 

TTGGGTGC^CGAGTGGGTTACATCXxAACTGGATCTCAACAGCGGT 

TCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTAT^ 

GCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACC^^ 

AG AG AATTATGCAGTGCTGC CATAACCATGAGTGATAACACTG CGGCCAACTT ACTTCTG ACAACGATCGG AGG ACCGAA 

GGAGCTAACCGCnTTTTTGCACAACATGGGGGATCATGTAACT 

TACCAAACGACGAC^TGACACCACGATGCCTGTAGCAATO 

A CTCT AG CTT C C CGG CAA CAATT AAT AG ACTGG ATGG AGG CGG AT AAAGTTGC^G^ CGCTCGGCCCTTCC 
GGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCG^ 
GTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTC^^ 
ATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACT 

G AT AAT CAG AAAAGC C C CAAAAA CAGGAAGATTGT AT AAG CAAAT ATTT AAA TTG T AAACG TT AA T AATT T G TT AAAA T T 
CGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGG 

AGCCCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAftGAGTCCACTATTAAAGAAC^ 
CGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACC^ 

AGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCGAACGTGGCG 

GAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCIGCGCGTAACCACC^ 

ATGCGCCGCTACAGGGCGCGTAAAAGGATCTAGGTGAAGATCCITTI^ 

GTTTTCGTTC CACTG AGCGT CAGACCCCGTAGAAAAGATCAAAGG ATCTTCTTG AGATCCTTTTTTT CTG CG CG T AATCT 
GGTGCTTGCAAAC^AAAAACCACCGCrACCAGCGGTGGm 

GT AACTGG CTTCAG CAG AG CG CAGATACCAAATACTGTTCTT CTAGTGTAGCCGTAG TTAGGC CACCACTTCAAG AACTC 
TGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGrTACCAGTC^ 

GG TTGG ACTCAAG ACX5ATAGTTACCGGATAW3GCGCAG CGGT CGGG CTGAACGGGGGGTTCG TG CA CACAG C C CAG CTTG 
G AGCG AACGACCT ACACCGAACTGAGATACCTACAGCGTG AG CT ATGAGAAAGCG CCACG CTTCCCG AAGGG AG AAAGG C 
GGACAGGTATCCTGGTAAGCGGCAGGGTCGGAACAGGAGAGCXJCACGAG 

ATAGTCCTGTGGGGTTTCGCCACCT^ 

AACGCCAGCAACGCGGCCITTTTACGGTTCCTGGCCTT^ 

ATTAGGCACCCCAGGCTITACACTTTATGCTrCCGGCTCG 

GGAAACAGCTATGACCATGATTACGCCAAGCTACGTAATACGACT 

TTCXSCTTGCTTCCGCGGGCCAAGCC^ 

ATTCCTGCAGGATTCGAGGGCCCCTGCAGGTCAATTCTACCGGGTAGGG 

GCGCTTTAGCAGCCCCGCTGGGACTTGGCGCTACACAAGTGGCC^ 

CCAACCGGCTCCGTTCTTTGGTGGCCCCTTCGCGCCACCTrCTACTCCT 

AGCTCG CGTCGTGCAGG ACGTGACAAATGGAAGTAGCACGTCTCACTAGTCTCGTGCAGATGGACAG CACCG CTG AGCAA 
TG^AAGCGGGTAGGCCTTTGGGG CAGCGGCCAATAGCAGCTTTGCT 
TGGGTCCGGGGGCGGGCTCAGGGGCGGGCTCAGGGGCGGGGCGGGCGCXSAAC^ 
GCTTCAAAAGCGCACGTCTGCCGCGCTGTTCTCCTCrrCCrCATCT 

G CCATTG AACAAG ATGG ATTG CACGCAGGTTCTCCGG CCG CTTGGG TGGAGAGGCT ATTCGG CT ATG ACTGGG CACAA CA 
G ACAATCGG CTG CT CTG ATG CCGCTCGTGTTCCGGCTG TCAG CG CAGGGCCG C CCGGTTCTTTTTGT CAAG AC CG AGGTGT 
CCGGTGCCCTG AATG AACTGCAGG ACGAGG CAGCGCGGCT AT CG TGG CTGG CCACG A CGGG CG TT C CTTG CG CAGCTG TG 
CTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCC^ 



TGCTCCTGCCG&GAAAGTATCCATCAT^ 
ACCACCAAGCGAAACATCGCATCGAGCGAGCA^ 
GAGCATCAG<KK5CTCGCX3CCAGCCX^AACTGTTCGCCAGGC^ 
CCATGGCGATGCCTGCTTGCCGAATATCAT^ 

TGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAG 
CTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGC 

T CG AT C CGT C CTG T AAG TCTG CAG AAATTG ATGAT CTATTAAA CAAT AAAGATGT CCACT AAAATGG AAG TTTTT C CTG T 

CATACHTTGTTAAGAAGGGTGAGAACAGAGTACCTACATTTTG^ 

GGGATTAGATAAATGCCTGCTCTTTACTGAAGGCT 



CATTGTTTTTCTCTTGATTCCCACTTTGTGGTTCTAA 

GAGATCAGCAGCCTCTGTTCCACATACACTTCATTCTCAGTATT^ 

TAGATCTGGATCCGGCCAGCTAGGCCGTCGACCT 

ACGACACAGGACACX>CAAATTAATTAA<X?CCGGCCCGTACCCTCT 

CCGTCGTTTTACAACGTCGTGACTTGGGAAAAC^ 

AG CTGG CGTAATAG CGAAGAGGCC CG CACCGATCG CCCTT CCCAACAGTTGCGCAG CCTGAATGG CG AATGG CG CTTCG C 
TTGGTAATAAAGCCCGCrrCGGCGGGC'ril'n-llT 



FIGURE 3B (ContiDUtcd) 
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FIGURE 6 



Annealing Annealing Annealing Annealing 
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FIGURE 7 



Annealing Annealing 
site 1 site 2 



5' 

Flanking 
DNA 



Annealing Annealing 
site 3 site 4 



ne<r 



3' 

Flanking 
DNA 



GFP negative 
screening marker 



pDG4 Vector 



J 



TCCCCACATTGCAAAGCCTACACAAAGATCCCTACCACTGAGCACCGAGGGAGGCATGGCTCAGAACCCC 
AGCAACATGGAGCCCTTGCGTAAACCACTGGTGCCTGTGAAGGGAATCCCACTCATCAAATACTTTGCGG 
AGACAATGGAGCAACTGCAGAACTTCACAGCCTGGCCTGATGATGTGCTCATCAGCACGTACCCAAAGTC 
TGGTACTAACTGGATGAGTGAGATCATGGATATGATCTATCAGGGTGGCAAGCTAGATAAGTGTGGCCGG 
GCCCCCGTCTATGCCCGGATACCCTTCCTTGAGTTCAGCTGCCCAGGGGTCCCCCCAGGTCTTGAAACTC 
TGAAAGAGACACCAGCCCCACGGATCATTAAGACACATCTGCCCTTGTCCTTACTCCCTCAGAGTCTGCT 
GGATCAAAAGATCAAGGTGATCTACGTTGCCCGAAATGCAAAGGATGTGGTTGTCTCCTATTATAACTTC 
TACAAAATGGCCAAGCTGCACCCTGACCCAGGCACCTGGGAAAGCTTCTTGGAGAACTTCATGGATGGGA 
AAGTGTCCTATGGGTCGTGGTACCAGCACGTGAAGGAGTGGTGGGAGCTGAGACGCACTCACCCTGTTCT 
CTATCTCTTCTATGAAGACATGAAGGAGAATCCCAAAAGGGAGATCAAGAAGATTCTAGAGTTTCTGGGG 
CGCTCTCTACCTGAGGAGACTGTGGATTTAATTGTTCACCACACATCCTTCAAGAAAATGAAGGAGAACC 
CCATGGCTAACTACACAACCATCCCAACTGAAGTTATGGACCACACTATTTATCCCTTCATGAGGAAAGG 
TACCATTGGGGACTGGAAAAATACCTTCACTGTAGCCCAGAGTGAGCACTTTGATGCCCACTATGCCAAG 
CTAATGACAGGTTGTGACTTCACGTTCCGCTGTCAAATATGAATTGTGGATATGGCTATACTGGGAACCA 
AGGCAAGCTGACACATCCCCATCATGATCTCAAGAGAAAAATGTGATGTGTTCATATTTGTTGTATGCCT 
AAAGGAAATCTGAGCTAAGAGAATAGGACTGGGATGTAGCTGAGGCAGAGGGTCTTATGAACATGTCAGG 
AAAGCCATCAGTCCTAACACTGAAAAAGAACCTAAAGTACAAACATGCAAAAATAGTAAGATAAACTGTA 
TTTTACCTGAACAAATAAATGCCACTGGGAGCTGACTGG 

(SEQ ID NO: 19 ) 



Targeting Vector (5' arm; 200 bp flanking neo insert); 

AGAACCCTGCCCTACTCCACCTCCTCCCCTTTTTGTTTCTGGAGAACAGCCAGTCCTAG 
CACTGTTTCCACTTCCTCCCACTTGGGACACAAAATCTCCAGCTCAAAGACCAATTCTG 
CATTCCCCACATTGCAAAGCCTACACAAAGATCCCTACCACTGAGCACCGAGGGAGG 
CATGGCTCAGAACCCCAGCAACATG 

(SEQ ID NO: 20) 

Targeting Vector (3* arm; 200 bp flanking neo insert): 

AGGGTGGCAAGCTAGATAAGTGTGGCCGGGCCCCCGTCTATGCCCGGATACCCTTCC 
TTGAGTTCAGCTGCCCAGGGGTCCCCCCAGGTATGTGCATGGGGGTGCTAGAGACAA 
GTGGAAAAAGGTAGGACCGGGCCCCAGTTTAACAAAGTTCCTTGTTCAACTTAGGTCT 
TGAAACTCTGAAAGAGACACCAGCCCCA 

(SEQ ID NO: 21) 



FIG. 8 



